Overview of the NTCIR-10 SpokenDoc-2 Task

نویسندگان

  • Tomoyosi Akiba
  • Hiromitsu Nishizaki
  • Kiyoaki Aikawa
  • Xinhui Hu
  • Yoshiaki Itoh
  • Tatsuya Kawahara
  • Seiichi Nakagawa
  • Hiroaki Nanjo
  • Yoichi Yamashita
چکیده

This paper describes an overview of the IR for Spoken Documents Task in NTCIR-10Workshop. In this task, the spoken term detection (STD) subtask and ad-hoc spoken content retrieval subtask (SCR) are conducted. Both of the tasks target to search terms, passages and documents included in academic oral presentations. This paper explains the data used in the tasks, how to make transcriptions by speech recognition and the details of each tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spoken Document Retrieval Experiments for SpokenDoc-2 at Ryukoku University (RYSDT)

In this paper, we describe spoken document retrieval systems in Ryukoku University, which were participated in NTCIR-10 IR for Spoken Documents (“SpokenDoc-2”) task. In NTCIR-10 “SpokenDoc-2” task, there are two subtasks: “spoken term detection (STD) subtask” and “ad-hoc spoken content retrieval (SCR) subtask”. We participated in the SCR subtask as team RYSDT. In this paper, our SCR systems are...

متن کامل

STD and SCR Techniques and Their Evaluations on the NTCIR-10 SpokenDoc-2 Task

This paper describes spoken term detection (STD) and spoken contents retrieval (SCR) techniques and their evaluations at the NTCIR-10 SpokenDoc-2 task. First of all, we describes our STD technique using a phoneme transition network (PTN) derived from multiple speech recognizers’ outputs and its evaluations at the STD and the iSTD (inexistent STD) tasks. Next, we introduce our SCR technique usin...

متن کامل

Spoken Document Retrieval Experiments for SpokenDoc at Ryukoku University (RYSDT)

In this paper, we describe spoken document retrieval systems in Ryukoku University, which were participated in NTCIR-9 IR for Spoken Documents (“SpokenDoc”) task. In NTCIR-9 “SpokenDoc” task, there are two subtasks: “Spoken term detection (STD) subtask” and “Spoken document retrieval (SDR) subtask”. We participated in the both subtasks as team RYSDT. In this paper, first, our STD systems are de...

متن کامل

Spoken Document Retrieval by Contents Complement and Keyword Expansion Using Subordinate Concept for NTCIR-SpokenDoc

We report on the result of investigating which relationship is important among hypernym and hyponym relationships in retrieval keyword expansion. Moreover, we report the effect of the keyword expansion and the contents complement for spoken document retrieval for SCR lecture retrieval task and SCR passage retrieval task. Spoken Document Retrieval by contents complement and keyword expansion usi...

متن کامل

DTW-Distance-Ordered Spoken Term Detection and STD-based Spoken Content Retrieval: Experiments at NTCIR-10 SpokenDoc-2

In this paper, we report our experiments at NTCIR-10 SpokenDoc-2 task. We participated both the STD and SCR subtasks of SpokenDoc. For STD subtask, we applied novel indexing method, called metric subspace indexing, previously proposed by us. One of the distinctive advantages of the method was that it could output the detection results in increasing order of distance without using any predefined...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013